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Description 

BACKGROUND OF THE INVENTION 

With the advent of recombinant DNA technology, the controlled bacterial production of an enormous 

s variety of useful polypeptides has become possible. Already in hand are bacteria modified by this 
technology to permit the production of such polypeptide products such as somatostatin {K. Itakura, et a/., 
Science 198, 1056 [1977]), the (component) A and B chains of human insulin (D. V. Goeddel, etaL, Proc Nat'I 
Acad Sci, USA 76, 106 [1979]), and human growth hormone (D. V. Goeddel, etaL Nature 281, 544 [1979]). 
More recently, recombinant DNA techniques have been used to occasion the bacterial production of 

ro thymosin alpha 1, an immune potentiating substance produced by the thymus. Such is the power of the 
technology that virtually any useful polypeptide can be bacterially produced, putting within reach the 
controlled manufacture of hormones, enzymes, antibodies, and vaccines against a wide variety of diseases. 
The cited materials, which describe in greater detail the representative examples referred to above, are 
incorporated herein by reference, as are other publications referred to infra, to illuminate the background of 

is the invention. 

The work horse of recombinant DNA technology is the plasmid, a non-chromosomal loop of double- 
stranded DNA found in bacteria, oftentimes in multiple copies per bacterial cell. Included in the information 
encoded in the plasmid DNA is that required to reproduce the plasmid in daughter ceils (i.e., a "repltcon") 
and ordinarily, one or more selection characteristics, such as resistance to antibiotics, which permit clones 

20 of the host cell containing the plasmid of interest to be recognized and preferentially grown in selective 
media. The utility of bacterial plasmids lies in the fact that they can be specifically cleaved by one or 
another restriction endonuclease or "restriction enzyme", each of which recognizes a different site on the 
plasmidic DNA. Thereafter heterologous genes or gene fragments may be inserted into the plasmid by 
endwise joining at the cleavage site or at reconstructed ends adjacent the cleavage site. As used herein, the 

25 term "heterologous" refers to a gene not ordinarily found in, or a polypeptide sequence ordinarily not 
produced by, £. coli, whereas the term "homologous" refers to a gene or polypeptide which is produced in 
wild-type E. cqVi. DNA recombination is performed outside the bacteria, but the resulting "recombinant" 
plasmid can be introduced into bacteria by a process known as transformation and large quantities of the 
heterologous .gene-containing recombinant plasmid obtained by growing the transformant. Moreover, 

30 where the gene is properly inserted with reference to portions of the plasmid which govern the 
transcription and translation of the encoded DNA message, the resulting expression vehicle can be used to 
actually produce the polypeptide sequence for which the inserted gene codes, a process referred to as 
expression. 

Expression is initiated in a region known as the promoter which is recognized by and bound by RNA 

35 polymerase. In some cases, as in the trp operon discussed infra, promoter regions are overlapped by 
"operator" regions to form a combined promoter-operator. Operators are DNA sequences which are 
recognized by so-called repressor proteins which serve to regulate the frequency of transcription initiation 
at a particular promoter. The polymerase travels along the DNA, transcribing the information contained in 
the coding strand from its 5' to 3' end into messenger RNA which is in turn translated into a polypeptide 

40 having the amino acid sequence for which the DNA codes. Each amino acid is encoded by a unique 
nucleotide triplet or "codon" within what may for present purposes be referred to as the "structural gene", 
i.e. that part which encodes the amino acid sequence of the expressed product. After binding to the 
promoter, the RNA polymerase first transcribes nucleotides encoding a ribosome binding site, then a 
translation initiation or "start" signal (ordinarily ATG, which in the resulting messenger RNA becomes 

45 AUG), then the nucleotide codons within the structural gene itself. So-called stop codons are transcribed at 
the end of the structural gene whereafter the polymerase may form an additional sequence of messenger 
RNA which, because of the presence of the stop signal, will remain untranslated by the ribosomes. 
Ribosomes bind to the binding site provided on the messenger RNA, in bacteria ordinarily as the mRNA is 

being formed, and themselves produce the-encoded polypeptide? beginning at the translation start signal 

so and ending at the previously mentioned stop signal. The desired product is produced if the sequences 
encoding the ribosome binding site are positioned properly with respect to the AUG initiator codon and if 
all remaining codons follow the initiator codon in phase. The resulting product may be obtained by iysing 
the host cell and recovering the product by appropriate purification from other bacterial protein. 

Polypeptides expressed through the use of recombinant DNA technology may be entirely 

55 heterologous, as in the case of the direct expression of human growth hormone, or alternatively may 
comprise a heterologous polypeptide and, fused thereto, at least a portion of the amino acid sequence of a 
homologous peptide, as in the case of the production of intermediates for somatostatin and the 
components of human insulin. In the latter cases, for example, the fused homologous polypeptide 
comprised a portion of the amino acid sequence for beta galactosidase. In those cases, the intended 

so bioactive product is bioinactivated by the fused, homologous polypeptide until the latter is cleaved away in 
an extracellular environment. Fusion proteins like those just mentioned can be designed so as to permit 
highly specific cleavage of the precusor protein from the intended product, as by the action of cyanogen 
bromide on methionine, or alternatively by enzymatic cleavage. See, eg., G.B. Patent Publication No. 
2 007 676 A. 

65 If recombinant DNA technology is to fully sustain its promise, systems must be devised which optimize 
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expression of gene inserts, so that the intended polypeptide products can be made available in high yield. 
The beta lactamase and lactose promoter-operator systems most commonly used in the past, while useful, 
have not fully utilized the capacity of the technology from the standpoint of yield. A need has existed for a 
bacterial expression vehicle capable of the controlled expression of desired polypeptide products in higher 
5 yield. 

Tryptophan is an amino acid produced by bacteria for use as a component part of homologous 
polypeptides in a biosynthetic pathway which proceeds: chorismic acid—* anthranilic acid— > 

phosphoribosyl anthranilic acid CDRP [enol-1Ho-carboxyphenylamino)-1-desoxy-D-ribulose-5- 

phosphate] -> indol-3-glycerol-phosphate f and ultimately to tryptophan itseif. The enzymatic reactions of 

10 this pathway are catalyzed by the products of the tryptophan or "trp"operon, a polycistronic DNA segment 
which is transcribed under the direction of the trp promoter-operator system. The resulting polycistronic 
messenger RNA encodes the so-calied trp leader sequence and then, in order, the polypeptides referred to 
as trp E, trp D, trp C, trp B and trp A. These polypeptides variously catalyze and control individual steps in 
the pathway chorismic acid tryptophan. 

is In wild-type £ coli, the tryptophan operon is under at least three distinct forms of control. In the case of 

promoter-operator repression, tryptophan acts as a corepressor and binds to its aporepressor to form an 
active repressor complex which, in turn, binds to the operator, closing down the pathway in its entirety. 
Secondly, by a process of feedback inhibition, tryptophan binds to a complex of the trp E and trp D 
polypeptides, prohibiting their participation in the pathway synthesis. Finally, control is effected by a 

20 process known as attenuation under the control of the "attenuator region" of the gene, a region within the 
trp leader sequence. See generally G. F. Miozzari et ah J. Bacteriology 133, 1457 (1978); The Operon 
263—302, Cold Spring Harbor Laboratory (1978), Miller and Reznikoff, eds.; F. Lee et af, Proc. Natl Acad. 
Set. USA 74, 4365 (1977) and K. Bertrand ef ah J. Mol. Biol. 103, 319 (1976). The extent of attenuation 
appears to be governed by the intracellular concentration of tryptophan, and in wild-type 5. coli the 

25 attenuator terminates expression in approximately nine out often cases, possibly through the formation of 
a secondary structure, or "termination loop", in the messenger RNA which causes the RNA polymerase to 
prematurely disengage from the associated DNA. 

Other workers have employed the trp operon to obtain some measure of heterologous polypeptide 
expression. This work, it is believed, attempted to deal with problems of repression and attenuation by the 

30 addition of indole acrylic acid, an inducer and analog which competes with tryptophan for trp repressor 
molecules, tending toward derepression by competitive inhibition. At the same time the inducer 
diminishes attenuation by inhibiting the enzymatic conversion of indole to tryptophan and thus effectively 
depriving the cell of tryptophan. As a result more polymerases successfully read through the attentuator. 
However, this approach appears problematic from the standpoint of completing translation consistently 

35 and in high yield, since tryptophan-containing protein sequences are prematurely terminated in synthesis 
due to lack of utilizable tryptophan. Indeed, an effective relief of attenuation by this approach is entirely 
dependent on severe tryptophan starvation. 

The present invention addresses problems associated with tryptophan repression and attenuation in a 
different manner and provides (1) a method for obtaining an expression vehicle designed for direct 

40 expression of heterologous genes from the trp promoter-operator, (2) methods for obtaining vehicles 
designed for expression, from the tryptophan operator-promoter, of specifically cieavable polypeptides 
coded by homologous-heterologous gene fusions and (3) a method of expressing heterologous 
polypeptides controllably, efficiently and in high yield, as well as the associated means. 

45 SUMMARY OF THE INVENTION 

According to the present invention, novel plasmidic expression vehicles are provided for the 
production in bacteria of heterologous polypeptide products, the vehicles having a sequence of double- 
stran ded DNA.com prising, in phase from. a first_51_to_ajsacorvd_3ijead..of_the_coding_strand, a trp promoter- 
operator, nucleotides coding for the trp leader ribosome binding site, and nucleotides encoding translation 
so initiation for expression of a structural gene that encodes the amino acid sequence of the heterologous 
polypeptide. The DNA sequence referred to contains neither a trp attenuator region nor nucleotides coding 
for the trp E ribosome binding site. Instead, the trp leader ribosome binding site is efficiently used to effect 
expression of the information encoded by an inserted gene. 

Cells are transformed by addition of the trp promoter-operator-containing and attenuator-lacking 
55 plasmids of the invention and grown up in the presence of additive tryptophan. The use of tryptophan- rich 
media provides sufficient tryptophan to essentially completely repress the trp promoter-operator through 
trp/repressor interactions, so that cell growth can proceed uninhibited by premature expression of large 
quantities of heterologous polypeptide encoded by an insert otherwise under the control of the trp 
promoter-operator system. When the recombinant culture has been grown to the levels appropriate for 
go industrial production of the polypeptide, on the other hand, the external source of tryptophan is removed, 
leaving the cell to rely only on the tryptophan that it can itself produce. The result is mild tryptophan 
limitation and, accordingly, the pathway is derepressed and highly efficient expression of the heterologous 
insert occurs, unhampered by attenuation because the attenuator region has been deleted from the 
system. In this manner the cells are never severely deprived of tryptophan and all proteins, whether they 
55 contain tryptophan or not, can be produced in substantial yields. 
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The manner in which these and other objects and advantages of th invention are obtained will 
become more apparent from the detailed description which follows and from the accompanying drawings. 
The illustrated embodiments make use of inventions which form the subject of European Patents 
EP — 81 — 36776 and EP — B1 — 86548. In the drawings: 

Figures 1 and 2 illustrate a preferred scheme for forming piasmids capable of expression heterologous 
genes as fusions with a portion of the trp D polypeptide, from which fusion they may be later cleaved; 

Figure 3 is the result of polyacrylamide gel segregation of cell protein containing homologous (trp D') 

— heterologous (somatostatin or thymosin a 1) fusion proteins; 

Figures 4, 5 and 6 illustrate successive stages in a preferred scheme for the creation of a plasmid 
capable of directly expressing a heterologous gene (human growth hormone) under the control of the trp 
promoter-operator system; 

Figure 7 is the result of polyacrylamide gel segregation of cell protein containing human growth 
hormone directly expressed under the control of the trp promoter-operator system; 

Figures 8, 9 (a — b) and 10 illustrate in successive stages a preferred scheme for the creation of piasmids 
capable of expressing heterologous genes (in the illustrated case, for somatostatin) as fusions with a 
portion of the trp E polypeptide, from which fusions they may be later cleaved; 

Figure 11 is the result of polyacrylamide gel segregation of cell protein containing homologous (trp E) 

— heterologous fusion proteins for the production of, respectively, somatostatin, thymosin alpha 1, human 
proinsulin, and the A and B chains of human insulin. 

Figures 12 and 13 illustrate in successive stages the manner in which the plasmid created by the 
scheme of Figures 8 — 10 inclusive is manipulated to form a system in which other heterologous genes may 
be interchangeably expressed as fusions with trp E polypeptide sequences. 

In the Figures, only the coding strand of the double-stranded plasmid and linear DNAs are depicted in 
most instances, for clarity in illustration. Antibiotic resistance-encoding genes are denoted ap R (ampicillin) 
and tc R (tetracycline). The legend tc s connotes a gene for tetracycline resistance that is not under the 
control of a promoter-operator system, such that piasmids containing the gene will nevertheless be 
tetracycline sensitive. The legend "ap s " connotes ampicillin sensitivity resulting from deletion of a portion 
of the gene encoding ampicillin sensitivity. Plasmidic promoters and operators are denoted "p" and "o". 
The letters A„ T, G and C respectively connote the nucleotides containing the bases adenine, thymine, 
guanine and cytosine. Other Figure legends appear from the text. 

The preferred embodiments of the invention described below involved use of a number of commonly 
available restriction endonucleases next identified, with their corresponding recognition sequences and 
(indicated by arrow) cleavage patterns. 



35 



Xbal 



EcoRI 



CTAGA 



AGATC^T 
GAATTC 



TaqI: 



TCGA 
AGCjT 



Hindlll: AAGCTT 



45 



SO 



Bglll: 



PvuII 



CTTAAG 
t 

/gatct 

TCTAGA 
t 

GAGCTG 



Hpal: 



PstI: 



TTCGAA 
t 

GTTAAC 



CAATTG- 
t 

CTGCAG 



60 



3amHt: 



GTCGAC 
t 

I 

GGATCC 

CCTAGG 
t 



GACGTC 
t 



Where the points of cleavage are spaced apart on the respective strands the cleaved ends will be "sticky", 
ie, capable of reannealmg or of annealing to other complementarily "sticfc"-ended ONA by Watson^Crick 
65 base pairing (A to T and G to C) in mortise and tenon fashion. Some restriction enzymes, such as Hpal and 
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Pvull above, cleave to leav "blunt" ends. The nucleotide sequences above are represented in accordance 
with the convention used throughout: upper strand is the protein encoding strand, and in proceeding from 
left to right on that strand one moves from the 5' to the 3' end thereof, ie, in the direction of transcription 
from a "proximal" toward a "distal" point. 

s Finally with regard to conventions, the symbol "A" connotes a deletion. Thus, for example, reference 
to a plasmid followed by, say, "AEcoRI-Xbal" describes the plasmid from which the nucleotide sequence 
between EcoRI and Xbat restriction enzyme sites has been removed by digestion with those enzymes. For 

convenience, jcertain deletions are denoted by number. Thus, beginning from the first base pair ("bp") of 

the EcoRI recognition site which precedes the gene for tetracycline resistance in the parental plasmid 

io pBR322, "A1" connotes deletion of bp1 — 30 (ie, AEcoRI-Hind III) and consequent disenabling of the 
tetracycline promoter-operator system; "A2" connotes deletion bp 1 — 375 (ie, AEcoRI-BamHI) and 
consequent removal of both the tetracycline promoter-operator and the structural gene which encodes 
tetracycline resistance; and "A3" connotes deletion of bp 3611—4359 (ie, APstl-EcoRI) and elimination of 
amptcillin resistance. "A4" is used to connote removal of bp —900 1500 from the trp operon fragment 5 

is (Rg. 1), eliminating the structural gene for the trp D polypeptide. 

DETAILED DESCRIPTION 

The trp leader sequence is made up of base pairs ("bp") 1 — 162, starting from the start point for trp 
mRNA. A fourteen amino acid putative trp leader polypeptide is encoded by bp 27 — 71 following the ATG 

20 nucleotides which encode the translation start signal. The trp attenuator region comprises successive GC- 
rich and AT-rich sequences lying between bp 1 14 and 156 and attenuation is apparently effected on mRNA 
nucleotides encoded by bp —134 — 141 of the leader sequence. To express a heterologous polypeptide 
under the direction of the trp leader ribosome binding site and at the same time avert attenuation, the 
following criteria must be observed: 

23 1. Base pairs 134 — 141 or beyond must be deleted; 

2. The ATG codon of the inserted gene must be positioned in correct relation to a ribosome binding 
site, as is known (see, eg., J. A. Steitz "Genetic signals and nucleotide sequences in messenger RNA" in 
Biological Regulation and Control (ed. R. Goldberger) Plenum Press, N.Y. (1978). 

3. Where a homologous-heterologous fusion protein is to be produced, the translation start signal of a 
30 homologous polypeptide sequence should remain available, and the codons for the homologous portion of 

the fusion protein have to be inserted in phase without intervening translation stop signals. 

For example, deleting all base pairs within the leader sequence distal from bp. 70 removes the 
attenuator region, leaves the ATG sequence which encodes the translation start signal, and eliminates the 
intervening translation stop encoded by TCA (bp. 69 — 71), by eliminating A and following nucleotides. 

35 Such a deletion would result in expression of a fusion protein beginning with the leader polypeptide, 
ending with that encoded by any heterologous insert, and including a distal region of one of the post-leader 
trp operon polypeptides determined by the extent of the deletion in the 3' direction. Thus a deletion 
extending into the E gene would lead to expression of a homologous precursor comprising the L sequence 
and the distal region of E (beyond the deletion endpoint) fused to the sequence encoded by any following 

4o insert, and so on. 

Two particularly useful plasmids from which the attenuator region has been deleted are the plasmids 
pGM1 and pGM3, G. F. Miozzari et a/, J. Bacteriology 133, 1457 (1978). These respectively carry the 
deletions trp ALE 1413 and trp ALE 1417 and express (under the control of the trp promoter-operator) a 
polypeptide comprising approximately the first six amino acids of the trp leader and distal regions of the E 

45 polypeptide. In the most preferred case, pGM1, only about the last third of the E polypeptide is expressed 
whereas pGM3 expresses almost the distal one half of the E polypeptide codons. E. colt K — 12 strain W31 10 
tna 2~frp~A102 containing pGMI has been deposited with the American Type Culture Collection (ATCC no. 
31622). pGM1 may be conventionally removed from the strain for use in the proced ures described below. 
Alternatively, deletions may be effected by the method disclosed in EP — B1 — 086548 for specifically 

so cleaving double-stranded DNA at any desired site. One example of this cleavage technique appears from 
Part IV of the experimental section, infra. Thus, double-stranded DNA is converted to single-stranded DNA 
in the region surrounding the intended cleavage point, as by reaction with lambda exonuclease. A synthetic 
or other single-stranded DNA primer is then hybridized to the single-stranded length earlier formed, by 
Watson-Crick base-pairing, the primer sequence being such as to ensure that the 5' end thereof will be 

55 coterminous with the nucleotide on the first strand Just prior to the intended cleavage point. The primer is 
next extended in the 3' direction by reaction with DNA polymerase, recreating that portion of the original 
double-stranded DNA prior to the intended cleavage that was lost in the first step. Simultaneously or 
thereafter, the portion of the first strand beyond the intended cleavage point is digested away. To 
summarize, where "v" marks the intended cleavage point: 

so 



a) v intended cleavage point H v M 

65 . . • . . . 
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b) 
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v made single stranded 

around "v" 



5 

c) ....... v primer hybridization 



10 

rj) v extension from primer 

^AAA/ 



e) v single strand digestion 

20 



In the most preferred embodiment, steps (d) and {e) are performed simultaneously, using a polymerase 
that simultaneously digests the protruding single stranded end in the 3' — » 5' direction and extends the 
25 primer (in the presence of dATP, dGTP, dTTP and dCTP) in the 5' -* 3' direction. The material preferred for 
this purpose is Klenow Polymerase I, ie, that fragment obtained by proteolytic cleavage of DNA Polymerase 
I which contains the 5' 3' polymerizing activity and the 3' -* 5' exonucleolytic activity of the parental 
enzyme, yet lacks its 5' ~» 3' exonucleolytic activity. A. Kornberg, DNA Synthesis, 98, W. H, Freeman and 
Co., SFO (1974). 

30 Using the procedure just described, attenuator deletions may be made in any desired manner in a trp 

operon-containing plasmid first linearized by, eg, cleavage at a restriction site downstream from the point 
at which the molecule is to be blunt-ended ("v" above). Recircularization following deletion of the 
attenuator region may be effected, eg, by blunt end ligation or other manners which will be apparent to the 
art-skilled. 

3s Although the invention encompasses direct expression of heterologous polypeptide under the 
direction of the trp promoter-operator, the preferred case involves expression of fused proteins containing 
both homologous and heterologous sequences, the latter preferably being specifically cleavable from the 
former in extra-cellular environs. Particularly preferred are fusions in which the homologous portion 
comprises one or more amino acids of the trp leader polypeptide and about one-third or more of the trp E 

40 amino acid sequence (distal end). Fusion proteins so obtained appear remarkably stabilized against 
degradation under expression conditions. 

Bacteria £ colt K — 12 strain W31 10 tna 2~frp~A102 (pGMI), ATCC No. 31622, may be used to amplify 
stocks of the pGM1 plasmid preferably employed in constructing the attenuator-deficient trp promoter- 
operator systems of the invention. This strain is phenotypicaily trp+ in the presence of anthranilate and can 

45 be grown in minimal media such as LB supplemented with 50 jag/ml anthranilate. 

All bacterial strains used in trp promoter-operator directed expression according to the invention are 
trp repressor" 1 * ("trp R + ") as in the case of wild-type E. coli, so as to ensure repression until heterologous 
expression is intended. 

DNA recombination isrin the-preferrechembodiment performed in £ coli, K — 1 2 strain 294 (encUA^thi^- 
50 hsr", hsm^), ATCC No. 31446, a bacterial strain whose membrane characteristics facilitate transformations. 
Heterologous polypeptide-producing plasmids grown in strain 294 are conventionally extracted and 
maintained in solution (eg, 10 mM tris, 1 mM EDTA, pH 8) at from about -20°C to about 4°C. For expression 
under industrial conditions, on the other hand, we prefer a more hardy strain, ie, £ coli K — 12 \~F~ RV 308 
str r , gal 308" ATCC No. 31608. RV 308 is nutritionally wild-type and grows well in minimal media, 
55 synthesizing all necessary macromolecules from conventional mixes of ammonium, phosphate and 
magnesium salts/trace metals and glucose. After transformation of RV 308 culture with strain 294-derived 
plasmid the culture is plated on media selective for a marker (such as antibiotic resistance) carried by the 
plasmid, and a transformant colony picked and grown in flask culture. Aliquots of the latter in 10% DMSO 
or glycerol solution (in sterile Wheaton vials) are shell frozen in an ethanoi-dry ice bath and frozen at -80°C. 
so To produce the encoded heterologous polypeptide the culture samples are grown up in media containing 
tryptophan so as to repress the trp promoter-operator and the system then deprived of additive tryptophan 
to occasion expression. 

For the first stage of growth one may employ, for example, LB medium (J. H. Miller, Experiments in 
Molecular Genetics, 433, Cold Spring Harbor Laboratory 1972) which contains, per liter aqueous solution, 
65 10 g Bacto tryptone, 5g Bacto yeast extract and 10 g NaCi. Preferably, the inoculant is grown to optical 
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density ("o.d.") of 10 or more (at 550 nM), more preferably to o.d. 20 or more, and most preferably to o.d. 
30 or more, albeit to less than stationary phase. 

For depression and expression the inoculant is next grown under conditions which deprive the cell of 
additive tryptophan. One appropriate media for such growth is M9 (J. H. Miller, supra et 431) prepared as 
5 follows (per liter): 

3g 

--6g 

0.5 g 

1 g 



CaCI 2 

MgS0 4 

glucose 

1 ug/ml 

Humko hycase amino 

or DIFCO cas. amino acids 40 ug/ml. 

25 

The amino acid supplement is a tryptophan-lacking acid hydrolysate of casein. 

To commence expression of the heterologous polypeptide the inoculant grown in tryptophan-rich 
media may, eg, be diluted into a larger volume of medium containing no additive tryptophan (for example, 
2—10 fold dilution) grown up to any desired level (preferably short of stationary growth phase) and the 

30 intended product conventionally obtained by lysis, centrifugation and purification. In the tryptophan- 
deprived growth stage, the ceils are preferably grown to od in excess of 1 0, more preferably in excess of od 
20 and most preferably to or beyond od 30 (all at 550 nM) before product recovery. 

Ail DNA recombination experiments described in the Experimental section which follows were 
conducted at Genentech Inc. in accordance with the National Institutes of Health Guidelines for 

35 Recombinant DNA research. 



KH 2 P0 4 

Na 2 HPQ 4 

w NaCI 

NH 4 CI 

Autoclave, then add: 
15 10 ml 0.01 M 

1 ml 1M 

10 ml 20% 

20 

Vitamin B1 



I. Expression of D-polypeptide fusion protein 

A preferred method of expressing fusion proteins comprising desired polypeptides and, fused thereto, 
a portion of the amino acid sequence of the trp D polypeptide that is separable in vitro by virtue of a 
40 methionine amino acid specifically sensitive to CNBr cleavage, is described with reference to Figures 1—3. 

A. Construction of pBRHtrp 

Plasmid pGM1 (7, Fig. 1) carries the £. coii tryptophan operon containing the deletion ALE1413 (G. F. 
Miozzari, eta/., (1978) J. Bacteriology 1457— -1466)) and hence expresses a fusion protein comprising the 

45 first 6 amino acids of the trp leader and approximately the last third of the trp E polypeptide (hereinafter 
referred to in conjunction as LE'), as well as the trp D polypeptide in its entirety, all under the control of the 
trp promoter-operator system. The plasmid, 20 ug, was digested with the restriction enzyme Pvull which 

cleaves the plasmid at five sites. The gene-fragments-2-were-next-combined with EcoRI linkers (consisting 

of a self complementary oligonucleotide 3 of the sequence: pCATG AATTCATG ) providing an EcoRI 

so cleavage site for a later cloning into a plasmid containing an EcoRI site (20). The 20 [ig of DNA fragments 2 
obtained from pGM1 were treated with 10 units T 4 DNA ligase in the presence of 200 pico moles of the 5'- 
phosphorylated synthetic oligonucleotide pCATGAATTCATG (3) and in 20 ul T 4 DNA ligase buffer (20 mM 
tris, pH 7.6, 0.5 mM ATP, 10 mM MgCi 2 , 5 mM dithiothreitol) at 4°C overnight. The solution was then heated 
10 minutes at 70°C to halt ligation. The linkers were cleaved by EcoRI digestion and the fragments, now 

55 with EcoRI ends were separated using 5 percent polyacrylamide gel electrophoresis (herein after "PAGE") 
and the three largest fragments isolated from the gel by first staining with ethidium bromide, locating the 
fragments with ultraviolet light, and cutting from the gel the portions of interest. Each gel fragment with 
300 microliters 0.1 x TBE, was placed in a dialysis bag and subjected to electrophoresis at 100 v for one 
hour in 0.1 x TBE buffer (TBE buffer contains: 10.8 gm tris base, 5.5 gm boric acid, 0.09 gm Na 2 EDTA in 1 

so liter H 2 0). The aqueous solution was collected from the dialysis bag, phenol extracted, chloroform 
extracted and made 0.2 M sodium chloride, and the DNA recovered in water after ethanol precipitation. [All 
DNA fragment isolations hereinafter described are performed using PAGE followed by the electroelution 
method just discussed]. The trp promoter-operator-containing gene with EcoRI sticky ends 5 was identified 
in the procedure next described, which entails the insertion of fragments into a tetracycline sensitive 

65 plasmid 6 which, upon promoter-operator insertion, becomes tetracycline resistant. 
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B. Creation of the plasmid pBRHtrp expressing tetracycline resistance under th control of the trp 
promoter-operator and identification and amplification of the trp promoter-op rator containing DNA 
fragment 5 isolated in (A.) above. 

Plasmid pBRH1 (6), {R. I. Rodriguez, et aL, Nucleic Acids Research 5, 3267—3287 [1979]) expresses 

5 ampicilin resistance and contains the gene for tetracycline resistance but, there being no associated 
promoter, does not express that resistance. The plasmid is accordingly tetracycline sensitive. By 
introducing a promoter-operator system in the EcoRI site, the plasmid can be made tetracycline resistant. 

pBRH1 was digested with EcoRI and the enzyme removed by phenol extraction followed by chloroform 
extraction "and "recovered in water after ethanol precipitation. The resulting DNA molecule 7 was, in 

w separate reaction mixtures, combined with each of the three DNA fragments obtained in part A. above and 
ligated with T 4 DNA ligase as previously described. The DNA present in the reaction mixture was used to 
transform competent £. coll K — 12 strain 294, K. Backman et aL, Proc Nat'l Acad Sci USA 73, 4174 — 4198 
[1976]) (ATCC no. 31448) by standard techniques (V. Hershfield et aL, Proc Nat'l Acad Sci USA 77, 
3455—3459 [1974]) and the bacteria plated on LB plates containing 20 ng/ml ampicillin and 5 ug/ml 

75 tetracycline. Several tetracycline-resistant colonies were selected, plasmid DNA isolated and the presence 
of the desired fragment confirmed by restriction enzyme analysis. The resulting plasmid 5, designated 
pBRHtrp, expresses p-lactamase, imparting ampicillin resistance, and it contains a DNA fragment including 
the trp promoter-operator and encoding a first protein comprising a fusion of the first six amino acids of the 
trp leader and approximately the last third of the trp E polypeptide (this polypeptide is designated LE'), and 

20 a second protein corresponding to approximately the first half of the trp D polypeptide (this polypeptide is 
designated D'), and a third protein coded for by the tetracycline resistance gene. 

C. Cloning genes for various end-product polypeptides and expression of these as fusion proteins 
comprising end-product and specifically cieavabletrp D polypeptide precursor (Figure 2). 

25 A DNA fragment comprising the trp promoter-operator and codons for the LE' and D' polypeptides was 
obtained from plasmid pBRHtrp and inserted into plasmids containing structural genes for various desired 
polypeptides, next exemplified for the case of somatostatin (Figure 2). 

pBRH trp was digested with EcoRI restriction enzyme and the resulting fragment 5 isolated by PAGE 
and electroeiution. EcoRi-digested plasmid pSom 11 (K. Itakura et al. Science 198, 1056 (1977); G.B. patent 

30 publication no. 2 007 676 A) was combined with fragment 5. The mixture was ligated with T A DNA ligase as 
previously described and the resulting DNA transformed into £. coll K — 12 strain 294 as previously 
described. Transformant bacteria were selected on ampicillin-containing plates. Resulting ampicillin- 
resistant colonies were screened by colony hybridization (M. Gruenstein et aL, Proc Nat'l Acad Sci USA 72, 
3951 — 3965 [1975]) using as a probe the trp promoter-operator-containing fragment 5 isolated from 

35 pBRHtrp, which had been radioactively labelled .with P 32 . Several colonies shown positive by colony 
hybridization were selected, plasmid DNA was isolated and the orientation of the inserted fragments 
determined by restriction analysis employing restriction enzymes Bglll and BamHI in double digestion. E. 
co// 294 containing the plasmid designated pSOM7A2, 7 7, which has the trp promoter-operator fragment in 
the desired orientation was grown in LB medium containing 10 tig/ml ampicillin. The cells were grown to 

40 optical density 1 (at 550 nM), collected by centrifugation and resuspended in M9 media in tenfold dilution. 
Cells were grown for 2 — 3 hours, again to optical density 1, then iysed and total cellular protein analyzed by 
SDS (sodium dodcyl sulfate) urea (15 percent) pofyacrylamide gel electrophoresis {J. V. Maizel Jr. era/., 
Meth Viral 5, 180—246 [1971]). 

Figure 3 illustrates a protein gel analysis in which total protein from various cultures is separated by 

45 size. The density of individual bands reflects the quantity in which the respective proteins are present. With 
reference to Figure 3, lanes 1 and 7 are controls and comprise a variety of proteins of previously 
determined size which serve as points of comparative reference. Lanes 2 and 3 segregate cellular protein 
from colonies of E. coll 294 transformed with plasmid pSom7 A2 and respectively grown in LB (lane 2) and 

M9-(iane-3)-media-Lanes 4 and 5 segregate cellular protein obtained-from-similar-cells-transformed with 

so the plasmid pTha7 A2, a thymosin expression plasmid obtained by procedures essentially identical to 
those already described, beginning with the plasmid pThal (see EP 35454 and also Wetzel et at., 
Biochemistry, 19: 6069 (1980)). Lane 4 segregates cellular protein from E. colt 294/pTha7A2 grown in LB 
media, whereas lane 5 segregates cell protein from the same transformant grown in M9 media. Lane 6, 
another control, is the protein pattern of E coli 294/pBR322 grown in LB. 
55 Comparison to controls shows the uppermost of the two most prominent bands in each of lanes 3 and 
5 to be proteins of size anticipated in the case of expression of a fusion protein comprising the D' 
polypeptide and, respectively, somatostatin and thymosin (the other prominent band represents the LE' 
polypeptide resulting from deletion of the attenuator). Figure 3 confirms that expression is repressed in 
tryptophan-rich media, but derepressed under tryptophan deficient conditions. 

60 

D. Cyanogen bromide cleavage and radioimmunoassay for hormone product 

For both the thymosin and somatostatin cases, total cellular protein was cyanogen bromide-cleaved, 
the cleavage product recovered and, after drying, was resuspended in buffer and analyzed by 
radioimmunoassay, confirming the expression of product immunologically identical, respectively, to 

65 
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somatostatin and thymosin. Cyanogen bromide cleavage was as described in D. V. Goeddel et aL, Proc 
Nat'l Acad Sci USA 76, 106—110 [1979]). 

II. Construction of plasmids for direct expression of heterologous genes under control of the trp promoter- 
5 operator system 

The strategy for direct expression entailed creation of a plasmid containing a unique restriction site 
distal from all control elements of the trp operon into which heterologous genes could be cloned in lieu of 
the trp leader sequence and in proper, spaced relation to the trp leader polypeptide's ribosome binding 
site. The direct expression approach is next exemplified for the case of human growth hormone 
jo expression. 

The plasmid pSom7 A2, 10 ug, was cleaved with EcoRI and the DNA fragment 5 containing the 
tryptophan genetic elements was isolated by PAGE and electroelution. This fragment, 2 ug, was digested 
with the restriction endonuciease Taq I, 2 units, 10 minutes at 37°C such that, on the average, only one of 
the approximately five Taq I sites in each molecule is cleaved. This partially digested mixture of fragments 
is was separated by PAGE and an approximately 300 base pair fragment 12 (Fig. 4) that contained one EcoRI 
end and one Taq i end was isolated by electroelution. The corresponding Taq I site is located between the 
transcription start and translation start sites and is 5 nucleotides upstream from the ATG codon of the trp 
leader peptide. The DNA sequence about this site is shown in Figure 4. By proceeding as described, a 
fragment could be isolated containing all control elements of the trp operon, i.e., promoter-operator 
20 system, transcription initiation signal, and trp leader ribosome binding site. 

The Taq I residue at the 3' end of the resulting fragment adjacent the translation start signal for the trp 
leader sequence was next converted into an Xbal site, as shown in Figure 5. This was done by ligating the 
fragment 12 obtained above to a plasmid containing a unique (i.e., only one) EcoRI site and a unique Xbal 
site. For this purpose, one may employ essentially any plasmid containing, in order, a replicon, a selectable 
25 marker such as antibiotic resistance, and EcoRI, Xbal and BamHI sites. Thus, for example, an Xbal site can 
be introduced between the EcoRI and BamHI sites of pBR322 (F. Bolivar et aL, Gene 2, 95 — 119 (1977]) by, 
e.g., cleaving at the plasmid's unique Hind 111 site with Hind 111 followed by single strand-specific nuclease 
digestion of the resulting sticky ends, and blut end ligation of a self annealing double-stranded synthetic 
nucleotide containing the recognition site such as CCTCTAGAGG. Alternatively, naturally derived DNA 
30 fragments may be employed,as was done in the present case, that contain a single Xbal site between EcoRI 
and BamHI cleavage residues. Thus, an EcoRI and BamHI digestion product of the viral genome of hepatitis 
B was obtained by conventional means and cloned into the EcoRI and BamHI sites of plasmid pGH6 (D. V. 
Goeddel et aL, Nature 257, 544 [1979]) to form the plasmid pHS32. Plasmid pHS32 was cleaved with Xbal, 
phenol extracted, chloroform extracted and ethanol precipitated. It was then treated with 1 u.1 E. coli 
35 polymerase I, Klenow fragment (Boehringer-Mannheim) in 30 ul polymerase buffer (50 mM potassium 
phosphate pH 7.4, 7 mM MgCI 2 , 1 mM p-mercaptoethanol) containing 0.1 mM dTTP and 0.1 mM dCTP for 
30 minutes at 0 a C then 2 hr. at 37°C. This treatment causes 2 of the 4 nucleotides complementary to the 5' 
protruding end of the Xbal cleavage site to be filled in: 

40 5' CTAGA 5' CTAGA 

3 l T > V TCT 



45 Two nucleotides, dC and dT, were incorporated giving an end with two 5' protruding nucleotides. This 
linear residue of plasmid pHS32 (after phenol and chloroform extraction and recovery in water after 
ethanol precipitation) was cleaved with EcoRI. The large plasmid fragment 13 was separated from the 
smaller EcoRI-Xbal fragme nt by PAGE_and isolated after electroelution . This DNA fra g ment from p H S32 
(0.2 pg), was tigated, under conditions similar to those described above, to the EcoRl-Taq I fragment of the 

so tryptophan operon (—0.01 ug), as shown in Figure 5. tn this process the Taq I protruding end is ligated to 
the Xbal remaining protruding end even though it is not completely Watson-Crick base-paired: 

T + CTAGA T CTAGA 

55 AGC TCT AGCTCT 

A portion of this ligation reaction mixture was transformed into E. coli 294 cells as in part I. above, heat 
treated and plated on LB plates containing ampicillin. Twenty-four colonies were selected, grown in 3 ml LB 
media, and plasmid isolated. Six of these were found to have the Xbal site regenerated via E. coli catalyzed 
60 DNA repair and replication: 

T CTAGA T CTAGA 

AGCTCT AGATCT 

65 
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These plasmids were also found to cleave both with EcoRI and Hpal and to give the expect d ^"J™ 
fragments On plasmid 14, designated pTrp 14, was used for expression of heterologous polypept.des. as 

^^rpTasmid pHGH !07 (?3 in Figure 6, D. V. Goedde. et .l r Nsture 281. 544. <^> "£ s ^ r 
human growth hormone made up of 23 amino acid codons produced from synthetic DNA laments and 
163 amino acid codons obtained from complementary DNA produced v. a reverse transcription of human 
growth hormone messenger RNA. This gene 21, though it lacks the codons of the pre sequence iof 
human growth hormone, does contain an ATG translation Initiation codon. The gene was isolated from 
10 ug pHGH 107 after treatment with EcoRI followed by E. coli polymerase I Klenow fragment and dTTP and 
dATP as described above. Following phenol and chloroform extraction and ethanol precipitation the 
plasmid was treated with BamHI. See Figure 6. . „.„ f -„ n ^ oH h „ 

The human growth hormone ("HGH") gene-containing fragment 21 was isolated by PAGE followed by 
electroelution. The resulting DNA fragment also contains the first 350 nucleotides of the tetracycline 
resistance structural gene, but lacks the tetracyline promoter-operator system "to*^^™^"*™* 
cloned into an expression plasmid, plasmids containing the insert can be located by the reatorat of 
tetracycline resistance. Because the EcoRI end of the fragment 21 has been filled in by the Klenow 
polymerase I procedure, the fragment has one blunt and one sticky end, ensur.ng proper orientation when 
later inserted into an expression plasmid. See Figure 6. * • • * * 

The expression plasmid P Trp14 was next prepared to receive the , HGH I gene-conta.nmg I fragment 
prepared above. Thus, P Trp14 was Xbal digested and the resulting sticky ends fiUed in with the Klenow 
polymerase I procedure .employing dATP, dTTP, dGTP and dCTP. After phenol and chloroform extraction 
and ethanol precipitation the resulting DNA 16 was treated with BamHI and the resulting large plasmid 
fragment 17 isolated by PAGE and electroelution. The pTrp14-derived fragment 17 had one blunt and one 
sticky end, permitting recombination in proper orientation with the HGH gene containing fragment 21 

P re ™ usi Y QH^^fragment 21 and the P Trp14 AXba-BamHl fragment 17 were combined and ligated 
together under conditions similar to those described above. The filled in Xbal and EcoRI ends ligated 
together by blunt end ligation to recreate both the Xbal and the EcoRI site: 

TCTAG + AATTCTATG t |cTAG AAIICTATG 

' AGATC TTAAGATAC ^ AGATcJrTAAjSATAC 

Xbal EcoRI 

This construction also recreates the tetracycline resistance gene. Since the plasmid pHGH 107 expresses 
tetracycline resistance from a promoter lying upstream from the HGH gene (the lac promoter), this 
construction 22, designated pHGH 207, permits expression of the gene for tetracycline resistance under the 
control of the tryptophan promoter-operator. Thus the ligation mixture was transformed into E. coli 294 
and colonies selected on LB plates containing 5 ug/ml tetracycline. 

In order to confirm the direct expression of human growth hormone from plasmid prion 207, total 
cellular protein derived from E. coli 294/pHGH 207 that had been grown to optical density 1 in LB media 
containing 10 ug/ml ampicillin and diluted 1 to 10 into M9 media, and grown again to optical density 1, was 
subjected to SDS gel electrophoresis as in the case of part I. above and compared to similar electrophoresis 
data obtained for human growth hormone as previously expressed by others (D. V. Goeddel et al, Nature, 
281, 544 (1979)). Figure 7 is a photograph of the resulting, stained gel wherein: Lanes 1 and 7 contain 
protein markers of various known sizes; Lane 2 is a control that separates total cellular protein of E. Coli 
strain 294 pBR322; Lane 3 segreg ates prote in from E. Coli 294/pHGH 107 grown in LB media; Lane 4 



segregates protein from E. Coli 294/pHGH 107 grown in M9 media; Lane 5 segregates protein from E. Coir 
294/pHGH 207 grown in LB media; and Lane 6 segregates protein from E. Coli 294/pHGH 207 grown in M9. 
The dense band in Lane 6 is human growth hormone, as shown by comparison to the similar bands in 
Lanes 2—4. As predicted by the invention, the organism E. Coli 294/pHGH 207 when grown in tryptophan- 
rich LB media produces less human growth hormone by reason of tryptophan repressor/operator 
interactions, and when grown in M9 media produces considerably more HGH than E. Coli 294/pHGH 107 
owing to the induction of the stronger tryptophan promoter-operator system vs the lac promoter-operator 
system in pHGH 107. 

111. Creation of a general expression plasmid for the direct expression of heterologous genes under control 
of the tryptophan promoter-operator. .... . 

The plasmid pHGH 207 created in the preceding section was next used to obtain a DNA fragment 
containing the control elements of the tryptophan operon (with the attenuator deleted) and to create a 
plasmid "expression vector" suitable for the direct expression of various structural gene inserts. The 
strategy for creation of the general expression plasmid involved removal of the tryptophan contro region 
from pHGH 207 by EcoRI digestion and insertion in the EcoRi-digested plasmid pBRH1 used in part I. supra. 
pBRH1, as previously noted, is an ampicillin resistant plasmid containing the tetracycline resistance gene 
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but is tetracyclin sensitive because of the absenc of a suitable promoter-operator system. The resulting 
plasmid, pHKY 1, whose construction is more particularly described below and shown in Figure 8, is both 
ampicillin and tetracycline resistant, contains the tryptophan promoter-operator system, lacks the 
tryptophan attenuator, and contains a unique Xbal site distal from the tryptophan promoter-operator. The 
5 tryptophan promoter-operator and unique Xbal site are bounded by EcoRI sites, such that the promoter- 
ope rator-Xbal-containing fragment can be removed for insertion in other structural gene-containing 
plasmids. 

Alternatively, heterologous structural genes may be inserted, either into the Xbal site or (after partial 

EcoRI digestion) into the EcoRI site distal from the tryptophan control region, in either case so as to come 

w under the control of the tryptophan promoter-operator system. 

Plasmid pHGH 207 was EcoRI digested and the trp promoter containing EcoRI fragment 23 recovered 
by PAGE followed by electroelution. 

Plasmid pBRH1 was EcoRI digested and the cleaved ends treated with bacterial alkaline phosphatase 
{"BAP") (1 ug, in 50 mM tris pH 8 and 10 mM MgCI 2 for 30 min. at 65°C) to remove the phosphate groups 

is on the protruding EcoRI ends. Excess bacterial alkaline phosphatase was removed by phenol extraction, 
chloroform extraction and ethanol precipitation. The resulting linear DNA 7a, because it lacks phosphates 
on the protruding ends thereof, will in ligation accept only inserts whose complementary sticky ends are 
phosphorylated but will not itself recircularize, permitting more facile screening for plasmids containing 
the inserts. The EcoRI fragment derived from pHGH 207 and the linear DNA obtained from pBRHI were 

20 combined in the presence of T 4 ligase as previously described and ligated. A portion of the resulting 
mixture was transformed into E. coli strain 294 as previously described, plated on LB media containing 
5 ug/ml of tetracycline, and 12 tetracycline resistant colonies selected. Plasmid was isolated from each 
colony and examined for the presence of a DNA insert by restriction endonuclease analysis employing 
EcoRI and Xbal. One plasmid containing the insert was designated pHKY1. 

25 

IV. Creation of a plasmid containing the tryptophan operon capable of expressing a specifically cleavable 
fusion protein comprising 6 amino acids of the trp leader peptide and the last third of the trp E polypeptide 
(designated LE') and a heterologous structural gene product. 

The strategy- for the creation of a LE' fusion protein expression plasmid entailed the following steps: 

30 a. Provision of a gene fragment comprising codons for the distal region of the LE' polypeptide having 

Bgl II and EcoRI sticky ends respectively at the 5' and at the 3' ends of the coding strand; 

b. Elimination of the codons from the distal region of the LE' gene fragment and those for the trp D 
gene from plasmid SOM 7 A2 and insertion of the fragment formed in step 1, reconstituting the LE' codon 
. sequence immediately upstream from that for the heterologous gene for somatostatin. 

35 1. With reference to Figure 9(a), plasmid pSom7 A2 was Hind 111 digested followed by digestion with 

lambda exonuclease {a 5' to 3' exonuclease) under conditions chosen so as to digest beyond the Bgl II 
restriction site within the LE' encoding region. 20 |jg of Hind ill-digested pSom 7 A2 was dissolved in buffer 
[20 mM glycine buffer, pH 9.6, 1 mM MgCI 2 , 1 mM 0-mercaptoethanol]. The resulting mixture was treated 
with 5 units of lambda exonuclease for 60 minutes at room temperature. The reaction mixture obtained 

40 was then phenol extracted, chloroform extracted and ethanol precipitated. 

In order ultimately to create an EcoRI residue at the distal end of the LE' gene fragment a primer 
32 pCCTGTGCATGAT was synthesized by the improved phosphotriester method (R. Crea et af„ Proc Nat'l 
Acad Sci USA 75, 5765 [1978]) and hybridized to the single stranded end of the LE' gene fragment resulting 
from lambda exonuclease digestion. The hybridization was performed as next described. 

45 20 tig of the lambda exonuclease-treated Hind III digestion product of plasmid pSom7 A2 was 

dissolved in 20 ul H 2 0 and combined with 6 ul of a solution containing approximately 80 picomoles of the 
5'-phosphorylated oligonucleotide described above. The synthetic fragment was hybridized to the 3' end of 

the„LELcoding.sequence and the remaining_single^strand_portion-atthe .LE'-fragment was filled in by the 

Klenow polymerase I procedure described above, using dATP, dTTP, dGTP and dCTP. 

50 The reaction mixture was heated to 50°C and let cool slowly to 10°C, whereafter 4 ul of Klenow enzyme 

were added. After 15 minute room temperature incubation, followed by 30 minutes incubation at 37°C, the 
reaction was stopped by the addition of 5 ul of 0.25 molar EDTA. The reaction mixture was phenol 
extracted, chloroform extracted and ethanol precipitated. The DNA was subsequently cleaved with the 
restriction enzyme Bgl II. The fragments were separated by PAGE. An autoradiogram obtained from the gel 

55 revealed a 3a p-labelled fragment of the expected length of approximately 470 bp, which was recovered by 
electroelution. As outlined, this fragment LE'(d) has a Bgl 11 and a blunt end coinciding with the beginning 
of the primer. 

The plasmid pThal described in part I{C.) above carries a structural gene for thymosin alpha one cloned 
at its 5' coding strand end into an EcoRI site and at its 3' end into a BamHI site. As shown in Figure 9b, the 
50 thymosin gene contains a Bgl II site as well. 

Plasmid pThal also contains a gene specifying ampicillin resistance. In order to create a plasmid 
capable of accepting the LE'(d) fragment prepared above, pThal was EcoRI digested followed by Klenow 
polymerase I reaction with dTTP and dATP to blunt the EcoRI residues. Bgl II digestion of the resulting 
product created a linear DNA fragment 33 containing the gene for ampicillin resistance and, at its opposite 
es ends, a sticky Bgl II residue and a blunt end. The resulting product could be recircularized by reaction with 
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the LE'(d) fragment containing a Bgl II sticky end and a blunt end in the presence of T< ligase to form the 
plasmid pTrp24 (Fig. 9b). In doing so, an EcoRI site is recreated at the position where biunt end ligation 
occurred. 

With reference to Figure 10, successive digestion of pTrp24 with Bgl II and EcoRI, followed by PAGE 

s and electroelution yields a fragment having codons for the LE'(d) polypeptide with a Bgl II sticky end and an 
EcoRI sticky end adjacent its 3' coding terminus. The LE'(d) fragment 38 can be cloned into the Bgl II site of 
plasmid pSom7 As to form an LE' polypeptide/somatostatin fusion protein expressed under the control of 
the tryptophan promoter-operator, as shown in Figure 10. To do so requires (1) partial EcoRI digestion of 
pSom7As in order to cleave the EcoRI site distal to the tryptophan promoter-operator, as shown in Figure 

JO 10 and (2) proper choice of the primer sequence (Figure 9} in order to properly maintain thecodon reading 
frame, and to recreate an EcoRI cleavage site. 

Thus, 16 iJg plasmid pSom7 A2 was diluted into 200 ul of buffer containing 20 mM Tris, pH 7.5, 5 mM 
MgCI 2 , 0.02 NP40 detergent, 100 mM NaCI and treated with 0.5 units EcoRI. After 15 minutes at 37°C, the 
reaction mixture was phenol extracted, chloroform extracted and ethanol precipitated and subsequently 

is digested with Bgl II. The larger resulting fragment 36 isolated by the PAGE procedure followed by 
electroelution. This fragment contains the codons "LE'(p)" for the proximal end of the LE' polypeptide, ie, 
those upstream from the Bgl 11 site. The fragment 36 was next ligated to the fragment 38 in the presence of 
T 4 DNA ligase to form the plasmid pSom7 A2A4, which upon transformation into E. coli strain 294, as 
previously described, efficiently produced a fusion protein consisting of the fully reconstituted LE' 

20 polypeptide and somatostatin under the control of the tryptophan promoter-operator. The fusion protein, 
from which the somatostatin may be specifically cleaved owing to the presence of a methionine at the 5' 
end of the somatostatin sequence was segregated by SDS polyacrylamide gel electrophoresis as 
previously described. The fusion protein product is the most distinct band apparent in Lane 6 of Figure 11, 
discussed in greater detail in Part VI, infra. 

25 

V. Creation of an expression system for trp LE' polypeptide fusions wherein tetracycline resistance is 
placed under the control of the tryptophan promoter-operator. 

The strategy for creation of an expression vehicle capable of receiving a wide variety of heterologous 
polypeptide genes for expression as trp LE' fusion proteins under the control of the tryptophan operon 
3Q entailed construction of a piasmid having the following characteristics: 

1 . Tetracycline resistance which would be lost in the event of the promoter-operator system controlling 
the genes specifying such resistance was excised. 

2. Removing the promoter-operator system that controls tetracycline resistance, and recircularizing by 
ligation to a heterologous gene and a tryptophan promoter-operator system in propoer reading phase with 

35 reference thereto, thus restoring tetracycline resistance and accordingly permitting identification of 
plasmids containing the heterologous gene insert. 

In short, and consistent with the nature of the intended inserts, the object was to create a linear piece of 
DNA having a Pst residue at its 3' end and a Bgl II residue at its 5' end, bounding a gene capable of 
specifying tetracycline resistance when brought under the control of a promoter-operator system. 

40 Thus, with reference to figure 12, plasmid pBR322 was Hind III digested and the protruding Hind 111 
ends in turn digested with S1 nuclease. The S1 nuclease digestion involved treatment of 10 ug of Hind Ill- 
cleaved pBR322 in 30 u.1 SI buffer (0.3 M NaCI, 1 mM ZnCl 2/ 25 mM sodium acetate, pH 4.5} with 300 units 
S1 nuclease for 30 minutes at 15°C. The reaction was stopped by the addition of 1 ul of 30 x SI nuclease 
stop solution (0.8 M tris base, 50 mM EDTA). The mixture was phenol extracted, chloroform extracted and 

45 ethanol precipitated, then EcoRI digested as previously described and the large fragment 46 obtained by 
PAGE procedure followed by electroelution. The fragment obtained has a first EcoRI sticky end and a 
second, blunt end whose coding strand begins with the nucleotide thymidine. As will be subsequently 
shown, the SI -digested Hind HI residue beginning with thymidine can be joined to a Klenow polymerase I- 
treated-BgUI residue-so as to reconstitute the Bgl II restriction site-upon-ligation. — — — . 

so Plasmid pSom7 A2, as prepared in Part I above, was Bgl II digested and the Bgl II sticky ends resulting 

made double stranded with the Klenow polymerase i procedure using all four deoxy nucleotide 
triphosphates. EcoRI cleavage of the resulting product followed by PAGE and electroelution of the small 
fragment 42 yielded a linear piece of DNA containing the tryptophan promoter-operator and codons of the 
LE' "proximal" sequence upstream from the Bgl II site ("LE'(p)"). The product had an EcoRI end and a blunt 

55 end resulting from filling in the Bgl II site. However, the Bgl 11 site is reconstituted by ligation of the blunt 
end of fragment 42 to the blunt end of fragment 46, Thus, the two fragments were ligated in the presence of 
T 4 DNA ligase to form the recircularized plasmid pHKY 10 (see Figure 12) which was propagated by 
transformation into competent £ coli strain 294 cells. Tetracycline resistant cells bearing the recombinant 
plasmid pHKY 10 were grown up, plasmid DNA extracted and digested in turn with Bgl II and Pst followed 

60 by isolation by the PAGE procedure and electroelution of the large fragment, a linear piece of DNA having 
Pst and Bgl II sticky ends. This DNA fragment 49 contains the origin of replication and subsequently proved 
useful as a first component in the construction of plasmids where both the genes coding for trp LE' 
polypeptide fusion proteins and the tet resistance gene are controlled by the trp promoter/operator. 

Plasmid pSom7 A2A4, as previously prepared in Part IV, could be manipulated to provide a second 

65 component for a system capable of receiving a wide variety of heterologous structural genes. With 
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reference to Figure 13, the piasmid was subjected to partial EcoRI digestion {see Part IV) followed by Pst 
digestion and fragment 51 containing the trp promoter/operator was isolated by the PAGE procedure 
followed by electroelution. Partial EcoRI digestion was necessary to obtain a fragment which was cleaved 
adjacent to the 5' end of the somatostatin gene but not cleaved at the EcoRI site present between the 
5 ampiciilin resistance gene and the trp promoter operator. Ampicillin resistance lost by the Pst I cut in the 
ap R gene could be restored upon ligation with fragment 49. 

In a first demonstration the third component, a structural gene for thymosin alpha-one was obtained 
by EcoRI and Bam HI digestion \of_ piasmid pThal. The fragment, 52, was purified _by_ PAGE „and„ 
electroelution. 

w The three gene fragments 49, 51 and 52 could now be ligated simultaneously together in proper 
orientation according to the method disclosed in EP — Bl — 0036776, as depicted in Figure 13, to form the 
piasmid pTha7A1A4, which could be selected by reason of the restoration of ampiciilin and tetracycline 
resistance. The piasmid, when transformed into E. coli strain 294 and grown up under conditions like those 
described in Part l, expressed a trp LE' polypeptide fusion protein from which thymosin alpha one could be 
5 specifically cleaved by cyanogen bromide treatment. When other heterologous structural genes having 
EcoRI and BamHI termini were similarly ligated with the pHKYIO-derived and pSOM7 A2A4-derived 
components, trp LE' polypeptide fusion proteins containing the polypeptides for which those heterologous 
genes code were likewise efficiently obtained. Figure 11 illustrates an SOS polyacrylamide gel 
electrophoresis separation of total cellular protein from E. coli strain 294transformants, the darkest band in 

20 each case representing the fusion protein product produced under control of the tryptophan promoter- 
operator system. In Figure 11, Lane 1 is a control which segregates total cellular protein from 5. coli 294/ 
pBR322. Lane 2 contains the somatostatin fusion product from piasmid pSom7 A2A4 prepared in Part IV. 
Lane 3 is the somatostatin-containing expression product of pSom7 A1 A4. Lane 4 contains the expression 
product of pTha7A1 A4, whereas Lane 5 contains the product expressed from a piasmid obtained when the 

25 pHKY-10-derived and pSom7 A2A4-derived fragments discussed above were ligated with an EcoRI/BamHI 
. terminated structural gene encoding human proinsulin and prepared in part by certain of us. Lanes 6 and 7 
respectively contain, as the darkest band, a trp LE' polypeptide fusion protein from which can be cleaved 
the B and A chain of human insulin. The insulin B and A structural genes were obtained by EcoRI and 
BamHI digestion of plasmids plB1 and plA11 respectively, whose construction is disclosed in D. V. Goeddel 

30 et a/., Proc Nat'I Acad Sci USA 76, 106 [1979]. Lane 8 contains size markers, as before. 

While the invention in its most preferred embodiment is described with reference to E. coli, other 
enterobacteriaceae could likewise serve as host cells for expression and as sources for trp operons, among 
which may be mentioned as examples Salmonella typhimurium and Serratia marcesans. Thus, the 
invention is not to be limited to the preferred embodiments described, but only by the lawful scope of the * 

35 - appended claims. 

Claims 

1. A plasmidic expression vehicle for the production in bacteria of a heterologous polypeptide product, 
40* said vehicle having a sequence of double-stranded DNA comprising, in phase from a first 5' to a second 3' 
end of the coding strand thereof, the elements: 

(i) a bacterial trp promoter-operator system; 

(ii) nucleotides coding for a ribosome binding site for translation of element (iv); 

(in) nucleotides coding for a translation start signal for translation of element (iv); and 
45 (iv) a structural gene encoding the amino acid sequence of a heterologous polypeptide; 

said sequence comprising neither any trp attenuation capability nor nucleotides coding for the trp E 
ribosome binding site. 

Z, The vehicle of claim 1 whe rein the polypeptide expressed by sai d structural gene is entirely 
heterologous. 

so 3. The vehicle of claim 1 wherein the polypeptide expressed is a fusion protein comprising a 
heterologous polypeptide and at least a portion of the amino acid sequence of a homologous polypeptide. 

4. The vehicle of claim 3 wherein said portion is a portion of the amino acid sequence of an enzyme 
involved in the biosynthetic pathway from chorismic acid to tryptophan. 

5. The vehicle of claim 3 or claim 4 wherein the heterologous polypeptide is a bioactive polypeptide 
55 and the fused homologous polypeptide is a specifically cleavable bioinactivating polypeptide. 

6. The vehicle of claim 3 wherein the homologous polypeptide is the trp E polypeptide and wherein 
said ribosome binding site is the ribosome binding site for the trp leader polypeptide. 

7. The vehicle of claim 3 wherein the homologous polypeptide is the trp D polypeptide. 

8. The vehicle of claim 6 wherein the fusion protein comprises a heterologous polypeptide and a 
so homologous polypeptide which itself constitutes a fusion of about the first six amino acids of the trp leader 

polypeptide and the amino acid sequence encoded by at least about the distal third of the trp E polypeptide 
gene. 

9. The vehicle of any one of the preceding claims wherein the heterologous polypeptide comprises a 
recoverable polypeptide selected from human growth hormone, human proinsulin, somatostatin, 

65 thymosin alpha 1, the A chain of human insulin and the B chain of human insulin. 
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10. A method of producing a polypeptide product by the expression in bacteria of a structural gene 
coding therefor which comprises: 

(a) providing a bacterial inoculant transformed with a replicable plasmidic expression vehicle having a 
sequence of double-stranded DNA comprising, in phase from a first 5' to a second 3' end of the coding 

5 strand thereof, the elements: 

(i) a bacterial trp promoter-operator system; 

(ii) nucleotides coding for a ribosome binding site for translation of element (iv); 

(Hi) nucleotides coding for a translation start signal for translation of element (iv); and 
(iv) a structural gene encoding the amino" acid sequence of a heterologous polypeptide; 
10 said sequence comprising neither any trp attenuation capability nor nucleotides coding for the trp E 
ribosome binding site; 

(b) placing the transformed inoculant in a fermentation vessel and growing the same to a 
predetermined level in suitable nutrient media containing additive tryptophan sufficient in quantity to 
repress said promoter-operator system; and , 

is (c) depriving said bacteria of said additive so as to derepress said system and occasion the expression 
of the product for which said structural gene codes. 

1 1 . The method of claim 1 0 wherein tryptophan deprivation is effected by cessation of addition of said 
additive and by dilution of the fermentation media in which said inoculant is first grown up. 

12. The method of claim 11 wherein the host bacterium is £ co/L 

20 13. The method of any one of claims 10, 11 and 12 wherein the polypeptide expressed by said 

structural gene is entirely heterologous. . 

14. The method of any one of claims 10 to 13 wherein the polypeptide expressed is a fusion protein 
comprising a heterologous polypeptide and at least a portion of the amino acid sequence of a homologous 
polypeptide. 

25 15. The method of claim 14 wherein said portion is a portion of the amino acid sequence of an enzyme 

involved in the biosynthetic pathway from chorismic acid to tryptophan. 

16. The method of claim 14 or claim 15 wherein the heterologous polypeptide is a btoactive 
polypeptide and the fused homologous polypeptide is a specifically cleavable bioinactivating polypeptide. 

17. The method of claim 14 wherein the homologous polypeptide is the trp E polypeptide and wherein 
30 said ribosome binding site is the ribosome binding site for the trp leader polypeptide. 

18. The method of claim 14 wherein the homologous polypeptide is the trp D polypeptide. 

19. The method of claim 17 wherein the fusion protein comprises a heterologous polypeptide and a 
homologous polypeptide which itself constitutes a fusion of about the first six amino acids of the trp leader 
polypeptide and the amino acid sequence encoded by at least about the distal third of the trp E polypeptide 

35 gene. 

20. The method of any one of claims 10 to 19 wherein the heterologous polypeptide comprises a 
recoverable polypeptide selected from human growth hormone, human proinsulin, somatostatin, 
thymosin alpha 1, the A chain of human insulin and the B chain of human insulin. 

40 Patentanspruche 

1. Pfasmidisches Expressionsvehikel fur die Erzeugung eines heterologen Polypeptidproduktes in 
Bakterien, wobei das genannte Vehikel eine Sequenz von doppelstrangiger DNA aufweist, die in Phase von 
einem ersten 5' Ende zu einem zweiten 3' Ende des kodierenden Stranges derselben die Elemente 

45 (i) ein bakerielles trp Promotor-Operator-System; 

(ii) Nucleotide, die fur eine Ribosombindungsstelie fur die Translation des Elementes (iv) kodieren; 

(iii) Nucleotide, die fur ein Translationsstartsignal fur die Translation von Element (iv) kodieren und 

(iv) ein Strukturgen, das die Aminosauresequenz eines heterologen Polypeptids kodiert; 
aufweist, und die genannte Sequenz wedereine~trp^t:hwachungsfahigkeit noch Nucleotide enthalt, die fur 

50 die trp E Ribosombindungsstelie kodieren. 

2. Vehikel nach Anspruch 1, worin das durch das genannten Strukturgen exprimierte Poiypeptid 
vollstandig heterolog ist. 

3. Vehikel nach Anspruch 1, worin das exprimierte Poiypeptid ein Fusionsprotein ist, das ein 
heterologes Polypetid und wenigstens einen Abschnitt der Aminosauresequenz eines homologen 

55 Polypeptids enthalt. 

4. Vehikel nach Anspruch 3, worin der genannte Abschnitt ein Abschnitt der Aminosauresequenz eines 
Enzyms ist das an der biosynthetischen Entstehungsbahn von Chorisminsaure saure zu Tryptophan 
beteiligt ist. 

5. Vehikel nach Anspruch 3 oder 4, worin das heterologe Poiypeptid ein biologisch aktives Poiypeptid 
so und das verschmolzene homologe Poiypeptid ein spezifisch spaltbares, bioiogisch inaktivierendes 

Poiypeptid ist. 

6. Vehikel nach Anspruch 3, worin das homologe Poiypeptid das trp E Poiypeptid ist und worin die 
genannte Ribosombindungsstelie die Ribosombindungsstelie fur das trp Leitpolypeptid ist. 

7. Vehikel nach Anspruch 3, worin das homologe Poiypeptid das trp D Poiypeptid ist. 

6S 8. Vehikel nach Anspruch 6, worin das Fusionsprotein ein heterologes Polypetid und ein homologes 
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Polypeptid umfafct, das seinerseits eine Fusion von etwa den ersten 6 Aminosauren des trp Leitpolypeptids 
und der Aminosauresequenz darstellt, die von wenigstens etwa dem distalen Drittel des trp E 
Poiypeptidgens kodiert ist. 

9. Vehtkel nach einem der vorhergehenden Anspruche, worin das heterologe Polypeptid ein 
ruckgewinnbares Polypeptid enthalt, das aus menschlichem Wachstumshormon, Human-Proinsulin, 
Somatostatin, Thymosin Alpha 1 , der A-Kette in Humaninsulin und der B-Kette von Humaninsulin qewahlt 
ist. 

10. Verfahren zur Hersteiiung eines Polypeptidprqduktes durch Expression eines dafur kodierenden 
Strukturgens in Bakterien, umfassend: 

(a) Schaffung eines bakteriellen fmpfstoffs, der mit einem replizierbaren plasmidischen 
Expressionsvehikel transformiert ist, das eine Sequennz von doppelstrangiger DNA aufweist, die in Phase 
von einem ersten 5'-Ende zu einem zweiten 3'-Ende ihres kodierenden Stranges die Eiemente 

(i) ein bakterielles trp Promotor-Operator-System; 

(ii) Nucleotide, die fur eine Ribosornbindungsstelle fur die Translation des Elementes (iv) kodieren; 

(iii) Nucleotide, die fur ein Translationsstartsignal fur die Translation des Elementes (iv) kodieren; und 

(iv) ein Strukturgen, das die Aminosauresequenz eines heterologen Polypeptids kodiert; 

umfaSt, wobei die genannte Sequenz weder eine trp Schwachungsfahigkeit noch Nucleotide umfa&t, die 
fur die trp E Ribosornbindungsstelle kodieren; 

(b) Einbringen des transformierten Impfstoffs in ein FermentationsgefaS und Kultivieren desselben auf 
einen vorgegebenen Wert in geeigneten Nahrmedien, die additives Tryptophan in ausreichender Menge 
enthalten, um das genannte Promotor-Operator-System zu reprimieren; und 

(c) Verarmen der genannten Bakterien am genannten Additiv, um das genannte System zu 
dereprimieren und die Expression des Produktes zu veranlassen, fur das das genannte Strukturgen kodiert. 

11. Verfahren nach Anspruch 10, worin die Tryptophanabreicherung durch Einstellen der Zugabe des 
genannten Additivs und durch Verdunnung der Fermentationsmedien bewirkt wird, in denen der genannte 
Impfstoff zuerst kultiviert wird. 

12. Verfahren nach Anspruch 11, worin das Wirtsbacterium E. coli ist. 

13. Verfahren nach einem der Anspruche 10, 121 und 12, worin das vom genannten Strukturgen 
exprimierte Polypeptid vollstandig heterolog ist. 

14. Verfahren nach einem der Anspruche 10 bis 13, worin das exprimierte Polypeptid ein 
Fusionsprotein ist, das ein heterologes Polypetid und wenigstens einen Abschnitt der Aminosauresequenz 
eines homologen Polypeptids enthalt. 

1 5. Verfahren nach Anspruch 14, worin der genannte Abschnitt ein Abschnitt der Aminosauresequenz 
eines Enzyms ist, das an der biosynthetischen Entstehungsbahn von Chorisminsaure zu Tryptophan 
beteiligt ist. 

16. Verfahren nach Anspruch 14 oder 15, worin das heterologe Polypeptid ein hbioaktives Polypeptid 
und das verschmolzene homologe Polypetid ein spezifisch spaltbares, inaktivierendes Polypeptid ist. 

17. Verfahren nach Anspruch 14, worin das homologe Polypeptid das trp E Polypeptid ist und worin die 
genannte Ribosornbindungsstelle die Ribosornbindungsstelle fur das trp Leitpolypeptid ist. 

18. Verfahren nach Anspruch 14, worin das homologe Polypeptid das trp D Polypeptid ist. 

19. Verfahren nach Anspruch 17, worin das Fusionsprotein ein heterologes Polypeptid und ein 
homologes Polypeptid enthalt, das selbst eine Verschmelzung von etwa den ersten 6 Aminosauren des trp 
Leitpolypeptids und der Aminosauresequenz darstellt, die von wenigstens etwa dem distalen Drittel des trp 
E Poiypeptidgens kodiert wird. 

20. Verfahren nach einem der Anspruche 10 bis 19, worin das heterologe Polypeptid ein 
ruckgewinnbares Polypeptid enthalt, das aus menschlichem Wachstumshormon, Human-Proinsulin, 
Somatostatin, Thymosin Alpha 1, der A-Kette von Human-Insulin und der B-Kette von Human-Insulin 
gewahit ist. 



Revendications 

1. Vecteur d'expression plasmidique pour la production, chez des bacteries, d'un polypeptide 
heteroiogue produit, ledit vecteur ayant une sequence d'ADN a deux brins comprenant, en phase, d'une 
premiere extremite 5' a une seconde extremite 3' de son brin codant, ies elements: 

(i) un systeme promoteur-operateur de trp bacterien; 

(ii) des nucleotides codant pour un site liant le ribosome pour une traduction de I'element (iv); 

(iii) des nucleotides codant pour un signal de debut de traduction pour une traduction de I'element (iv); 

et 

(iv) un gene de structure codant la sequence d'acides amines d'un polypeptide heteroiogue; 

ladite sequence ne comprenant ni capacite d'attenuation de trp ni nucleotides codant pour le site liant le 
ribosome de trp E. 

2. Vecteur de la revendication 1 ou le polypeptide exprime par ledit gene de structure est totaiement 
heteroiogue. 

3. Vecteur de !a revendication 1 ou te polypeptide exprime est une proteine de fusion comprenant un 
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polypeptide heterologue et au moins une portion de la sequence d'acides amines d'un polypeptide 
homologue. 

4. Vecteur de la revendication 3 od ladite portion est une portion de la sequence d'acides amines d'un 
enzyme mis en cause dans le trajet biosynthetique de I'acide chorismique au tryptophane. 
5 5. Vecteur. de la revendication 3 ou la revendication 4 ou le polypeptide heterologue est un polypeptide 
bioactif et le polypeptide homologue fusionne est un polypeptide bio-inactivant specifiquement scindable. 

6. Vecteur de la revendication 3 ou le polypeptide homologue est le polypeptide de trp E et ou ledit site 
liant le ribosome est le site liant le ribosome pour le polypeptide leader de trp. 

7. Vecteur de la revendication 3 ou le polypeptide homologue est le polypeptide D de trp. 

jo 8. Vecteur de la revendication 6 ou la proteine de fusion comprend un polypeptide heterologue et un 
polypeptide homologue qui constitue lui-meme une fusion des six premiers acides amines, environ, du 
polypeptide leader de trp et la sequence d'acides amines codee par au moins environ le tiers distal du gene 
du polypeptide E de trp, 

9. Vecteur selon I'une quelconque des revendications precedentes ou ie polypeptide heterologue 
is comprend un polypeptide recuperable choisi parmi I'hormone de la croissance humaine, la proinsuline 

humaine, la somatostatine, I'alpha thymosine 1, la chaine A de 1'insuline humaine et la chalne B de 
I'insuline humaine. 

10. Precede de production d'un polypeptide par 1'expression, chez des bacteries, d'un gene de 
structure le codant qui consiste a: 

20 (a) prevoir un inoculant bacterien transforme par un vecteur d'expression plasmidique replicable ayant 
une sequence d'ADN a deux brins comprenant, en phase, d'une premiere extremite 5' a une seconde 
extremite 3' de son brin de codage, les elements: 

(i) un systeme promoteur-operateur de trp bacterien; 

(ii) des nucleotides codant pour un site liant ie ribosome pour la traduction de I'eiement (iv); 

25 (iii) des nucleotides codant pour un signal de debut de traduction pour la traduction de i'eiement (iv); et 
(iv) un gene de structure codant la sequence d'acides amines d'un polypeptide heterologue; 
ladite sequence ne comprenant ni capacite d'attenuation de trp ni nucleotides codant pour le site liant le 
ribosome de trp E; 

(b) placer J'inoculant transforme dans un recipient de fermentation et le faire croitre jusqu'a un niveau 
30 predetermine dans des milieux nutritifs appropries contenant du tryptophane additif en une quantite 

suffisante pour reprimer ledit systeme promoteur-operateur; et 

(c) priver ladite bacterie dudit additif afin de dereprimer ledit systeme et d'occasionner I'expression du 
produit pour lequel code ledit gene de structure. 

11. Procede de la revendication 10 ou Ea privation du tryptophane est effectuee par arret de ('addition 
35 dudit additif et par dilution des milieux de fermentation ou Ton fait d'abord crottre ledit inoculant. 

12. Procede de la revendication 11 ou la bacterieu note est £. coli. 

13. Procede selon I'une des revendications 10, 11 et 12 ou le polypeptide exprime par iedit gene de 
structure est totalement heterologue. 

14. Procede selon I'une des revendications 10 a 13 ou le polypeptide exprim£ est une proteine de 
40 fusion comprenant un polypeptide heterologue et au moins une portion de la sequence d'acides amines 

d'un polypeptide homologue. 

15. Procede de ia revendication 14 ou ladite portion est une portion de la sequence d'acides amines 
d'un enzyme implique dans le trajet biosynthetique de I'acide chorismique au tryptophane. 

16. Procede de la revendication 14 ou de la revendication 15 ou le polypeptide heterologue est un 
45 polypeptyde bioactif et la polypeptide homologue fusionne est un polypeptide bio-inactivant 

specifiquement scindable. 

17. Procede de la revendication 14 ou le polypeptide homologue est le polypeptide de trp E et ou ledit 
site l iant le ribosome est le site liant le ribosome pour le polypeptide leader de trp. 

187ProcedeTde riaH-everidi cation 14 ou le polypeptide"homoiogue _ est ie polypeptide de trp D. 
so 19. Procede de la revendication 17 ou la proteine de fusion comprend un polypeptide heterologue et un 

polypeptide homologue qui constitue lui-meme une fusion des six premiers acides amines environ du 
polypeptide leader de trp et la sequence d'acides amines codee par au moins environ le tiers distal du gene 
du polypeptide trp E. 

20. Procede selon I'une des revendications 10 a 19 ou le polypeptide heterologue comprend un 
55 polypeptide recuperable choisi parmi I'hormone de la croissance humaine, la proinsuline humaine, la 
somatostatine, I'alpha 1 thymosine, la chame A de I'insuline humaine et la chame B de I'insuline humaine. 
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